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Abstract. Algorithms to compute the quantum Fourier transform over a 
cycHc group are fundamental to many quantum algorithms. This paper de- 
scribes such an algorithm and gives a proof of its correctness, tightening some 
claimed performance bounds given earlier. Exact bounds are given for the 
number of qubits needed to achieve a desired tolerance, allowing simulation of 
the algorithm. 



1. Introduction 

Most quantum algorithms giving an exponential speedup over classical algo- 
rithms rely on efficiently computing Fourier transforms over some finite group 
m, |, [lO[ |llHl|. The Abehan group case depends on fast quantum algorithms 



for doing Fourier transforms over cyclic groups 14 . The thesis 0| and the 

paper ||^ describe such a quantum algorithm, but the proofs are incorrect. This 
note attempts to correct those proofs, and in the process obtains stronger bounds 
for many of their results, and a few weaker ones. The end result is a proof of the 
correctness of their algorithm, with concrete bounds suitable for quantum simu- 
lation instead of the asymptotic bounds listed in their papers. Our final result is 



theorem 16 



2. Preliminaries 

Efficient algorithms for the quantum Fourier transform over finite Abelian groups 
are constructed from the algorithms for the transform over cyclic groups, which in 
turn reduce to computing the transform efficiently over prime order groups iQ, p^ . 
Efficient algorithms for computing the quantum Fourier transform over a cyclic 
group of order 2™ for a positive integer m are well known, and are used in Shor's 
factoring algorithm: see Coppersmith and Shor |6j for example. We will 
show an algorithm for computing the quantum Fourier transform over an odd order 
cyclic group. The algorithm (containing minor errors) is given in [Q and but 
their proofs of the correctness of the algorithm and subsequent performance bounds 
are incorrect. This paper corrects those proofs and obtains new bounds. For 
applications of this algorithm, see 0| and A proof of similar ideas using a 
different method is in [Bl. 



Date: March 2004. 

2000 Mathematics Subject Classification. 03D15, 42A85, 68Q05, 68W40, 81P68. 
Key words and phrases, algorithms, quantum computers. Hidden Subgroup Problem, quantum 
Fourier transform, cyclic quantum Fourier transform. 

Research supported by AFRL grant F30602-03-C-0064. 

1 



2 



CHRIS LOMONT 



2.1. Notation and basic facts. We fix three integers: an odd integer N > 3, 
L > 2 a power of 2, and M > LN a power of 2. This gives (M, N) = 1, which we 
need later. 

Some notation and facts to clarify the presentation: 

• \/— T will be written explicitly, as i will always denote an index. 

• For an integer n > 1, let cj„ = g^-^V^/n (;Jenote a primitive n^^ root of 
unity. 



Fact: 



1 



< \6\ as can be seen from arc length on the unit circle. 

Thus for real values a we 



If -TT < 61 < TT wc also^ have | f | < 1 - e^^ 

have \l~ujfi\ < |2f^|, etc. 
logn denotes log base 2, while Inn is the natural log. Since M and L are 
powers of two, [logAf] = [logAfJ = [logAf] = logM, and similarly for L, 
but we often leave the symbols to emphasize expressions are integral. 
For a real number x, \x~\ is the smallest integer greater than or equal to 
X, [x\ is the largest integer less than or equal to x, and [a;] is the nearest 
integer, with ties rounding up^. We often use the three relations: 

1,1 1 

x~-<lx\<x + - 

x--l<lx\<x 
x < \x~\ < a; + 1 

Indices: i and s will be indices from 0,l,...,iV — 1. j will index from 
0, 1, . . . , i — 1. k will index from 0, 1, . . . , Af — 1. a and b will be arbitrary 
indices, t will index from a set Cg, defined in definition ^ below. 
Given i € {0, 1, . . . , iV — 1}, let i' = denote the nearest integer to j^i 

with ties broken as above. Similarly for s and s' . Note < i' < M — 1. 
For a real number x and positive real number n, let x mod n denote the 
real number y such that < y < n and y = x + mn for an integer m. 
Note that we do not think of x mod n as an equivalence class, but as a real 
number in [0, n). 

\u) and \v) are vectors in spaces defined later, and given a vector \u) denote 
its coefficients relative to the standard (orthonormal) basis {|0), |1), . . . , |n— 
1)} by uo,"!, • ■ • etc. 
For a real number x, let 



I A/ 



X mod M if < (a; mod M) < f- 

—X mod M otherwise 



Thus < |a::|j\/ < Properties of this function are easiest to see by noting 
it is a sawtooth function, with period M, and height M/2. 
For an integer s set (5^ = [^s] - ^^s. Then \Ss\ < i. 

The (unitary) Fourier transform over a cyclic group of order N is denoted 
Fn- Thus if |w) = Ejlo^ then Fn\u) ^ ^ E^s^o u>.^n\s)- We write 
|u) ~ Fn\u), with coefficients Ui. 
EjIo^ = 1 implies J2t M < 



^This range can be extended slightly. 

^We could break ties arbitrarily with the same results. 
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The majority of the errors in 0| and |^ resuhed from misunderstanding the 
consequences of their versions of the following two definitions. The first defines sets 
of integers which will play an important role: 

Definition 1. For j = 0, 1, . . . , iV — 1, let (i) denote the set of integers in the open 
interval ^ ^ + + ^ ^ 5) taken mod M . Recall i' = [^*] • 

The second definition we make precise is a division and remainder operation: 

Definition 2. Given M, N as above. Set a = + ij , and [3 = - |] . We 
define the map A : {0, 1, . . . , il/ — 1} ^ {0, 1, . . . , iV — 1} x {—a, —a + 1, . . . , a} , as 

follows: for any k G {0, 1, . . . , M — 1}, let k ^ (s, t), via 

N' 



k' = 



N 



t = k- 
s = k' mod 

We extend this definition to a transform of basis elements \k) via 

A\k) = \s)\t + a) 

and extend to all vectors by linearity. 

Finally, from the image of A, define Cs = { t | (s, G Image A} to be those 

values oft appearing for a fixed s. Thus X^fclo^ l'^) X^fL'o^ X^tec ^ '^)- 

We will show the integers {— /3, ■ • ■ , /?} C Cs C {—a, . . . , a} for all s, which is 
why we defined f3 with the A definition, a and (3 remain fixed throughout the 
paper. 

For the proofs to work, we need that the sets (i) are disjoint and have the same 
cardinality. Neither |0] nor |^ define these sets mod M, although perhaps it is 
implied. 0| makes a similar definition without the ^ terms, but the resulting sets 
are not then disjoint. Q makes a similar definition, but uses ^i instead of i' 
and drops the ^ terms, which results in sets of varying cardinality. To see the 
differences, check the three definitions using M = 32 and iV 5. Both § and 
|8| implicitely assume their resulting sets are disjoint and of constant cardinality in 
numerous places, invalidating many proofs. Note also that the mod M condition 
gives M — 1, S (0) when M > 3N. We now show that the sets defined here have 
the required properties: 

Lemma 3. For ii 7^ 12 G {0, 1, . . . , iV - 1} , 

(1) = |fe)| 

(2) {n)f]{^2) - 

Proof. Each set is defined using an interval of constant width, centered at an integer, 
so the sets will have the same cardinality. To show disjointness, for any integer a, 
take the rightmost bound Ra — [^f a] ~^ ~ 1 of an interval and compare it to 
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the leftmost bound La+i 



M 
2N 



i of the next intervah 



La-\ 



Ra — 



> 



M 

— a - 
M 



M 
— ( 

N 



M 

'n 



- -TT + 1 



AI 



1\ M 

2 -iV+^ 



(3) 

(4) 

(5) = 

giving that the open intervals are disjoint. Thus taking the integers in the intervals 
mod M remains disjoint (which requires ii,i2 '^^ N — \). □ 

The second error which propagates throughout the proofs in Q and |^ stems 
from misconceptions about the division operation A. Both papers treat the image 
of the A map as a cartesian product, that is, the range on t is the same for all 
values s {M = 32 and = 5 illustrates how this fails to give a bijection with their 
definitions). However, the image is not a cartesian product; the values t assumes 
depend on s, otherwise we would have that M is a multiple of N. In other words, 
the cardinality of Cg depends on s, with bounds given in the following lemma, 
where we show that our definition works and list some properties: 

Lemma 4. Using the notation from definition ^ 

1 ) the map A is well defined, and a bijection with its image, 

2) a = P+l, 

3) the sets of integers satisfy {—/?,..., /?} C Cg C {—a,..., a} for all s G 
{0,1,...,7V-1}. 



Proof. Given a fc in {0, 1, . . . , M - 1}, let A(fc) = 
a = + To check that —a < t < a, note 



{s,t). Clearly 0< s < iV-1. Set 



(6) 



givmg 



N, I ,, N , 1 
— k~ - < fc < -T-rk+7: 
M 2 - - M 2 



(7) 



M 



1 

. - > t 
2N 2 - 



> 



M 
2N 



and t integral allows the rounding operation. Thus the definition makes sense. 

Next we check that both forms of A in the definition are bijections. Suppose 
ki ^ fc2 are both in {0, 1, . . . , M — 1}, with images A{kr) = {sr,tr), r = 1,2. Let 
fc; = [§kr] , r = 1, 2. Note < fc; < N. 

Assume (si,ti) = (s2,t2)- If k[ = fcj, then 



(8) 
(9) 



fci 



^fc2 



N 1 



= fci - 



AT ^2 



M 



a contradiction. So we are left with the case k[ ^ k^- In order for si = si we 



have (without loss of generality) k'^ — 0, fc'2 



M < M -\- M 



N. 



-1, a contradiction. 



But then ii = fci > and 
Thus A in the first sense is a 



^2 = fc2 _ 
bijection. 

The second interpretation follows easily, since —a < t < a gives < t + a < 2a. 
So the second register needs to have a basis with at least 2a + 1 elements, which 
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causes the number of qubits needed"^ to implement the algorithm to be [log M] + 2 
instead of [log M~\ . 

To see a ~ /3 + 1, bound a — f3 using the methods above, and^ one obtains 
2 > a - /3 > 0. 

All integers between [^(s + 1)] and \J^s\ inclusive must be of the form ti + 
[^s] for ti S Cs or of the form t2 + [^f (s + 1)] for 1-2 G Cg+i- This range contains 
\Jw{^ + 1)1 ~ \Jw^'\ +1 — 77' integers, and at most a + 1 of these are of the form 
<2 + Llf + 1)1 with <2 e Cs+i- This leaves at least [^1 - " > 5^ - | that have 
to be of the form ti + \J^s\ with ti G C,, implying (3 <E Cs- Similar arguments give 
±P e C, thus {-/3, ...,P}CCsQ {-a, . . . , a} for all s. □ 

A is efficient to implement as a quantum operation, since it is efficient classically 
m Chapter 4]. Finally we note that A, being a bijection, can be extended to a 
permutation of basis vectors | k) , thus can be considered an efficiently implementable 
unitary operation. 

We define some vectors we will need. For i £ {0,1, . . . , N — 1} define 

M-lLN-1 

\B^) = \A^) restricted to integers in the set {i) 

LN-1 
be{i) a=0 

|T') = restricted to integers outside the set (i) 

= \A')-\B') 

LN-l 
^ b^{i) a=0 

Think A* for actual values, for bump functions, and for tail functions. Note 
that the coefficients El and are just A\ for h in the proper ranges. 

We also define three equivalent shifted versions of 1-8°). Note that to make these 
definitions equivalent we require the sets (i) to have the same cardinality. Let 
1^') - E6e(0) Bt\b + i') = Efce(o) Al\b + *') = E.6(i) where each b ± z' 

expression is taken mod M. The \S^) have disjoint support, which follows from 
lemma ^, and will be important for proving theorem 



•^This is proven in theorem 

^(M, N) = 1 is used to get the strict inequalities. 
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3. The Algorithm 

The algorithm takes a unit vector (quantum state) \u) on [log A^] qubits^, does 
a Fourier transform F^, L a power of two, on another register containing |0) with 
[log M] — [log A^] + 2 qubits, to create® a superposition, and then reindexes the basis 
to create L (normalized) copies of the coefficients of \u), resulting in \ul). Then 
another power of two Fourier transform Fm is applied. The division A results in a 
vector very close to the desired output Fn\u) in the first register, with garbage in 
the second register (with some slight entanglement). The point of this paper is to 
show how close the output is to this tensor product. We use [log A/] + 2 qubits, 
viewed in two ways: as a single register |fc), or as a [log TV] qubit first register, with 
the remaining qubits in the second register, written |s)|t). We note that merely 
[log M] qubits may not be enough qubits to hold some of the intermediate results. 
The algorithm is: 

3.1. The odd cyclic quantum Fourier transform algorithm. 

7V-1L-1 

(10) N|0) ^ 

/I 1 \ multiply 1 I . , 

(11) > --^2^u,\t+jN) 



(12) = \ul) 



M-1 



(13) ^ T^EE"^'^^"^'!^) 

(14) ^ 4^E-EE-M^"^^"L^^^^i^)i^+") 

^ i,j s=0 tGCs 

(15) = 47E"^-5^i^)\/^EE4?^""*"'^'i^+") 



N ^ V LM 

i,s=o teCa j=o 

(16) = \v) 

\ul) is the vector that is L copies of the coefficients from \u), normalized. \v) is 
the algorithm output. 

Notice that Fp^\u) appears in the output in line |l^, but the rest is unfortunately 
dependent on s and i. However the dependence is small: if were the same for all 
s, if the 6s, which are bounded in magnitude by i, were actually zero, and if the i 
dependence were dropped, then the output would leave Fn\u) in the first register. 
The paper shows this is approximately true, and quantifies the error. 

4. Initial bounds 

We need many bounds to reach the final theorem, which we now begin proving, 
jrj makes the mistake of missing the —1 in the following lemma^; using a 

^Recall logs are base 2. 

^Note it may be more efficient to apply the Hadamard operator H to each qubit in |0). 

^Using the definitions in [Q, a — ^ instead of -1 is sufficient. Even then, however, M = 128, 
N = 37, i = 12, and fc = 40 shows the error. Compare our lemma ^ to the proof of claim 4, 
section 9.2.3, in M. 
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different definition for the {i), is correct in dropping the —1. To avoid these subtle 
errors we thus prove 

Lemma 5. For integers N > 2, M > 2N , and any i e {0, 1, . . . , iV - 1}, k G 
{0, 1, . . . , Af — 1}, with k ^ (i), we have 

M 



(17) 



M 



^ 27V ^ 



Proof. The sets (i) are disjoint, so wc do two cases. If i = 0, tlicn k <^ (0) impUes 
(18) 

from which it follows that 
(19) 



Ml Ml 

<k< M h - 

2N 2 - - 2A^ 2 



M 



M 



M 1 M 
- 2N ^ 2^ 2N 



1 



If i 7^ 0, then either k is less than the integers in (i) or greater than the integers 
in (i), giving two subcases. Subcase 1: 



(20) 

implying 
(21) 



< fc < 
M 

2N " 



M 



M 1 M M 

h-< — i hi 

2N 2 - N 2N 



M , M M 

1 < — i~k< — i < M 

- N - N - N 



which gives the bound. Subcase 2 is then 

M " 



(22) 
which implies 
(23) 



M M 

— i H 1 < 

N 2N 



N 



2N 2- - 



M 
2N 



Kk 



M M 



giving the bound and the proof. 



□ 



We now bound many of the \A^) coefficients. Our bound has a factor of tt not 
in j?! and making it somewhat tighter, and wc avoid special cases® where the 
statement would not be true. 

Liemma 6. For k e {0,1, M - 1} and i e {0,1, N - 1}, with ± - not 
an integer, then 

(24) 



MM ^ , A/ 2 



Proof. We rewrite from the definition 



(25) 
(26) 



^ Vlmn 



LN-1 
a=0 



a(fc-M,) 



||7| and [Q missed these cases by not placing any restriction such as our hypothesis that 
— -j^ is non-integral. Compare our lemma ^ with Observation 2, section 9.2.3 in and with 
Observation 1, section 3.1, in Mi- 
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which is a geometric series. By hypothesis, uj\j " 7^ 1, so we can sum as^ 



(27) 



Al 



1 



VLMN 



l-UJ 



LN{k~M.i) 



M 



1 -UJ 



M 



The numerator is bounded above by 2, and the denominator satisfies 



(28) 
(29) 

These together give 
(30) 



1-UJ 



(fe-#0 



M 



TT \k - 



Al 



< 



> 



M 



M 



n''\m 



M 
2 



□ 



Note our initial requirement that (M, N) = 1 is strong enough to satisfy the 
non-integral hypothesis in lemma ^, except for the case i = k = 0, which we will 
avoid. 

Next we bound a sum of these terms. We fix 7 = ^ — |j for the rest of this 
paper. 

' ^ For 



Lemma 7. Given integers N > 2 and M > 2N , with N odd. Let 7—3 m 
a fixed integer k e {0, 1, . . . , ilf — 1}, 

(31) E 



1 



i=0 



\k - 





N - 1 








) 









Proof. The minimum value of the denominator is at least ^ — 1 by lemma and 
the rest are spaced out by but can occur twice^° since the denominator is a 
sawtooth function going over one period, giving that 

N-1 



(32) 

(33) 

(34) 
(35) 



E 

i=0 



1 



I k -^i 



< 



2E 



1 



M 

a=0 2N 



-1 + 



< 




□ 



Without this requirement, the sum would be LN, much different than the claimed sum. The 
hypotheses avoid the resulting divide by zero. 

^'^Both |i| and Iq] appear to overlook this fact. 
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The generality of the above lemma would be useful where physically adding more 
qubits than necessary would be costly, since the lemma lets the bound tighten as |j 
decreases. However the following corollary is what we will use in the final theorem. 

Corollary 8. Given integers N >l'i and M > 16N , with N odd. For a fixed value 

fce{0,i,...,M-l}, 



(36) 



N-l 

E 

km 



1 



< 



M 



M 



Proof. Using lemma M > 16N gives ^ < and 



-+ln 

7 



N - 1 



27 



(37) 

(38) 

(39) 
(40) 

where the last step required N > 



16 

y 



< — + In 



8{N - 1) 



In 



7 

8(iV^ 



1) 



7 



< In 



< 



2\nN 



e-N 



8 ^ 

I e 7 



> 11.2. The corollary follows. 



□ 



claimed an incorrect bound of 



correct 



4jV In TV 
M 



2 N In N 
M 



in section 9.2.3, and [pi obtained the 



in section 3.1, but both made the errors listed above. 



Next we prove a bound on a sum of the above terms, weighted with a real unit 
vector. This will lead to a bound on the tails || . Our bound has an extra 

term compared to the claimed bounds in and 1^, but corrects an error in their 
proofs. 

Lemma 9. Given integers N > IZ and M > 16N , with N odd. For any unit vector 



(41) 



M-l 

E 

A;=0 



N-l 

E 

1=0 



N^\ 



M 



< 



22NlnN 32iV3 



M 



Proof. We split the expression into three parts, the first of which we can bound 
using methods from |^ and |8|, and the other two terms we bound separately. 

Using the A operator from definition ^ along with the values a and (3 defined 
there, and using lemma ^, wc can rewrite each k with k = t+ \_j^k'~\ = t+ j^k' + 5s. 
Since s differs from k' by a multiple of and the \x\]\i function has period M, 
in \^{k' — i) + t + Ss \ wc can replace k' with s. Rewrite the left hand side of 
inequality El^ as 



(42) 



M-l 

E 

k=0 



N-l 

E 

i=0 

km 



^i\ 

n''\m 



N-l 

E 



E 



AT-l 

E- 



is~i)+t + 5s 



M 



"'^"'^This fails, for example, at M = 256, N = 13, k = 26, using either their definitions or our 
definitions. 
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Letting A/j = (s, i), note that k ^ (z) if and only if s 7^ z, which can be shown from 
the definitions and the rounding rules used earher. To simphfy notation, write 
9i s — ^ i) + t + ^s- We have not changed the values of the denominators, so 
|9*,sIm > ^ — 1 by lemma ^ for all i, (s, t) in this proof. 

We want to swap the s and t sums, but we need to remove the t dependence on 
s. Again using lemma ^ we can split the expression into the three terms"'^^: 

2 



(43) 



(44) 



(45) 



/3 N-1 

EE 

t=-/3 s=0 



N-1 

E 



E 



+ 



s with aGCs 



E 

s with — qGCs 



N-1 

E 

4=0 



i^»,«Im 



■>=0 ni.siM 



7V-1 



Next we bound the first term 43, For a unit vector a; and fixed t we rewrite the s, i 
sum as the norm of a square matrix Pt acting on x, so that the sum over s and i 
becomes 

2 



Af-l 



(46) 



s=0 



N-1 

Ig* I 

i=0 ni,slA/ 



We also define similarly to each Pt a matrix Qt which is the same except for minor 
modifications to the denominator: 

2 

N-1 



(47) 



s=0 



E 

i=0 



I A/ 



Note this matrix is circulant^'^, since each entry in the matrix only depends on s — i. 
Also each entry is nonnegative^'*. Thus the expression is maximized by the vector 
y = (1,1,...,!) as shown in each of 0|, jsj, and j^. Now we relate these matrix 
,vio T3„r.cn 1^* I., ^ 

2N 

lower and upper bounds 



expressions. Recall |q*sIm > ^ — 1 and \Ss\ < \- Set A — jj^2N- Then we find 



(48) 
and 
(49) 



1-A = 1 



1 





s 


6s 


M 






M 



'^\2N 



< 



1) 



< 





M 


1 
2 




g* 


M 



< 





M 








M 



< 1 



^\2N 



Ig* - 




|g* 


M 




1 + A 


f 1) 





-'^^The second two terms are missed in [Q and 

"'^'^That is, each row after the first is the cycUc shift by one from the previous row. 



"I"* -5s\m> \qt ,1m - 5>w-i>0 '^i"^*^ M > 37V 
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Rewriting 
(50) 



iPtxW 



N-l 

E 



N-l 

E 





4 


s 




M 




M 


(7* 


M 



and using the bounds gives 



(51) 



{l-Xf\\Qtxr<\\Ptx\\'<il + X)'\\Qtxr 



Then since y maximizes ||Qta:|| , 

(52) \\Ptxf < (1 + Xf WQtxf < (1 + A)2 WQtyf < 



1 + A 
1 - A 



\Pty\\ 



giving that we can bound the leftmost term by (^t^I j times the norm at y. i^jzj 

takes on values between 1 and y|| sa 1.33 for AI > 16N, better than the constant 
4 in § and [|. 

Combined with corollary this allows us to bound term ES: 



(53) 

(54) 
(55) 
(56) 



P N-l 

EE 



N-l 

EXi 



E 225 ^ — A 
169 ^ 

t s=0 



N-l 1 

E Vn 
k* I 

i=o 



< (2/3+1) 



225 iV /4A^ln7V 
169iV V M 



< 



< 



Af 225 /4A^lniV 
iV 169 V M 
22Ar In^ N 



M 



Now we bound the other two terms, ^ and We need the following fact, which 



can be shown with calculus: the expression 



^N-l 



subject to the condition 

J2iLo^ — 1' maximum value \Jj2iLo^ '^f- Then term ^ can be bounded 
using a similar technique as in the proof of lemma 0. Again we take 7 ^ ^ ^ 



2 M- 



(57) 

(58) 

(59) 
(60) 



E 



s with qGCs 



7V-1 



E X' 
I 



i = Q I^I^SlA/ 



< 



E 



N-l 



E^ 



< N 



2N^ / 1 



J_ 1_ 



< 



< 



1 2 ^ (1_ N_, \ 

\ a=l \2 M ^ 

"M2~ 
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Term ^is bound with the same method and result, and adding these three bounds 
gives the desired inequaUty |4^. □ 

Similar to the proof of the previous lemma, Both and § claim the following 
bound 



(61) 



E 

fc=0 



N-1 

E 

1=0 



n'-\m 



, A/-1 



N 



k=0 



N-1 

E 



leading to (in their papers) 



(62) 



A/-1 

E 

fe=0 



N-1 

E 

mi) 



I k -jTfi 



< 



GAN In^ N 



So our bound tightens their 64 to a 22, but has a new term accounting for the 
extra pieces in the proof. However, both (section 9.2.4) and (appendix C) 
had the following flaws their proofs. Both proofs rearranged the left expression to 
be bounded by a matrix norm, and then "rearranged" the matrix to be square. 
This fails due to the subtle nature of the A operation they implicitly used. They 
claimed the resulting matrix differed only slightly from their previous one, which 
is false, since many terms may have to be changed from to a large value. They 
relied on the resulting matrix being circulant and being close to the to their initial 
expression, which it is not due to these extra terms. Our proof above is based on 
their methods, but avoids the errors they made by pulling out the incorrect terms 
and bounding them separately, resulting in the extra term in our bound. 

We now use these lemmata to bound the tails The bound 

was claimed in [Q, section 9.2.3, and section 3.1, but our new terms from lemma 
^ give us a more complicated bound: 

Lemma 10. Given three integers: an odd integer N > 13, L > 2 a power of two, 
and M > \QN a power of two, then 



(63) 
Proof. 

(64) 

(65) 
(66) 



N-l 



i=0 



< 



2 22\n^ N 32iV2 



LM 



N-l 



1=0 



A/-1 

E 

fc=0 



JV-1 



1=0 

mi 



< 



< 



E 



E 

^ N-l 

E 



AM 



ir^LN 



. 1=0 



\k - 



AM /227V In TV 32N^ 



TT^LN 



M 



A/2 
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Taking square roots gives the result. Note that the requirements of lemma |^ are 
satisfied when obtaining line since we avoid the k ^ i = case, and {M, N) = 
1. □ 

Next we show that the shifted are close to the which will allow us to 
show the algorithm output is close to a tensor product. This mirrors Q claim 5, 
section 9.2.1, and |^ claim 2, section 3. In both cases their constant was 4, where 
we obtain the better bound 



Lemma 11. 

(67) 



\S') - IB' 



< 



ttLN 



Proof. RccaU \S') = A°b-^' „.od a/I&> and \B') ^ll^)- It is important 

these are supported on the same indices! Also recall that \A^) = FMF£j^\Li) and 
that Fi\i is unitary. Then (dropping mod M throughout for brevity) 



(68) \S') - \B' 

(69) 
(70) 
(71) 
(72) 

and this can be bounded by 
2TraS.j 



< 



\Y,AU,\b)-Y,Al\b) 

he(i) 6e(j) 

M-l M-l 



fe=0 



k=0 



. k=0 



LN-1 

E 

a=0 



/LN 



-Ul 



— E 

LN ^ 



a=0 



1 ad, 
1 -'^M 



N 



(73) 



1 



E 

a=0 



M 



< 



LNM"^ 



LN~1 

E«^ 

a=0 



< 



{LNf 



LNM^ 



Taking square roots gives the bound. 



□ 



In the above proof, to obtain line |6^ we needed that |S'*) and \B') have the same 
support, but 1 5') is a shifted version of \B^), so we implicitly needed all the sets 
(i) to have the same cardinality. This is not satisfied in ^ (although it is needed) 
but is met in 0. 

For the rest of he paper we need a set which is (0) without mod Af applied: let 
A be those integers in the closed interval [— — ij , ^ ^J] ■ Then 



Lemma 12. 

(74) 



A\S') 



'E^?l*- 



teA 
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Proof. By definition, \S'^) = Ehefo) ^bl^ + Lf *1 modM). A (6 + [ 



(the proof uses {AI,N) — 1), and A a bijection implies A|6 + [^i] mod il/) = 
|i) |6 + a) . The rest foUows^^ □ 

5. Main results 

Now we are ready to use the above lemmata to prove the main theorem. 

Theorem 13. Given three integers: an odd integer N > 13, L > 16 a power of 
two , and M > LN a power of two. Then the output \v) of the algorithm in section 
3.1 satisfies 



teA 



(75) \v)^FN\u)(E)J2^t\t + o^ 
Proof. Note 

(76) \u) := Fn\u) = J2 



2 /221n iV . 32iV2 . t:LN 



LM M\/3 



JV-l 



N-l 



ui\i) = ^ M,|yl*) 

1=0 i=0 

Using lemma n2^ and that A is unitary allows us to rewrite the left hand side as 



N~l 



(77) \v)-Y,u,A',\s)\t- 



s=0 

teCe 



(78) 



(79) 



AFm\ul) - J2 

N-l N-l 



s=0 
N-l 



s=0 



N-l 



By the triangle inequality this is bounded by 

Af-l N-l 

(80) II 



N-l 



II^^^T.r)) +||^^^.|i?^))-^^^,|^^ 

s=0 s=0 s=0 

which in turn by lemmata ^ and ^ is bounded by 
(81) 



2 /221n2iV 32Ar2 



L 



LAI mV3\ 



/El 



The last expression has |||m)|| = 1, which gives the result. Note that to obtain line 
^we needed the supports of the \B'^) disjoint, and that the \S^) and jS') have the 



same support 



16 



□ 



This shows that the output of the algorithm in section 3.1 is close to a tensor 
product of the desired output Fn\u) and another vector (which is not in general a 
unit vector). Since a quantum state is a unit vector, we compare the output to a 
unit vector in the direction of our approximation via: 



^^It is tempting to use Cninstead of A, but this is not correct in all cases. 
"'^^This is not satisfied in ItI, and the overlapping portions make that proof invalid. 
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Lemma 14. Let a be a unit vector in a finite dimensional vector space, and b any 



vector in that space. For any < e < 1, if 



:-b 



< e then the unit vector b' in 



the direction of b satisfies 



< eV2. 



Proof. Simple geometry shows the distance is bounded by y 2(1 — \/l — e^), and 

this expression divided by e has maximum value V2 on (0, 1]. The e case IS 
direct. □ 

So we only need a factor to compare the algorithm output with a unit vector 
which is Fn\u) tensor another unit vector. We let \tjj} denote the unit length vector 
in the direction of X)teA I* + ^^^^ ^^^^ paper^^. 

For completeness, we repeat arguments from ^ to obtain the operation com- 
plexity and probability distribution, and we show concrete choices for M and L 
achieving a desired error bound. 

To show that measuring the first register gives measurement statistics which are 
very close to the desired distribution, we need some notation. Given two probability 
distributions V and V over {0, 1, . . . , M- 1}, let \V-V'\ = Efclo^ Pi^^) " '^'i'^. 
denote the total variation distance. Then a result^^ of Bernstein and Vazirani 
states that if the distance between any two states is small, then so are the induced'^^ 
probability distributions: 

Lemma 15 ([Q, Lemma 3.6). Let \a) and |/3) be two normalized states, inducing 
probability distributions Va and . Then for any e > 

(82) \\\a)-\p)\\<e^\V^-Vfj\<2e + e^ 

independent of what basis is used for measurement. 

Combining this with theorem |l^ and lemmata |l^ and 15 gives the final result 

Theorem 16. 

1) Given an odd integer N > 13, and any V2 > e > 0. Choose L > IQ and 
M > LN both integral powers of 2 satisfying 



2 /221n^A^ 327V2 nLN e 



Then there is a unit vector \ip) such that the output \v) of the algorithm in section 
^■l| satisfies 

(84) \\v) ~ Fn\u) ® < e 

2) We can always find such an L and M by choosing 

(85) L = c^^ 



(86) M = C2^ 



^^The subscripts in are taken modAf . 

^^Their statement is a bound of 4e, but their proof gives the stronger result Hsted above. We 
choose the stronger form to help minimize the number of qubits needed for simulations. 
^^The induced distribution from a state |</<) is I'(fc) = |(fc|(/>)p. 
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for some constants ci,C2 satisfying 

(87) 65 < ci < 2 X 65 

735 < C2 < 2 X 735 



3) The algorithm requires [log M~\ + 2 qubits. By claim 2 a sufficient num- 
ber of qubits is then 12.53 + 3 log . The algorithm has operation complexity 
0(logM(loglogi\/ + logl/e)). Again using claim 2 yields an operation complexity 
of 

(89) O l^log ^ l^log log ^ + log ^ 

4) The induced probability distributions T>y from the output and T> from Fn\u) (g) 
satisfy 

(90) \Vy - V\ < 2e + e'^ 

Proof. Claim 1 follows directly from theorem |l^ and lemma Claim 1 and lemma 
pis] give claim 4. 

To get claim 2, note that for the bound to be met, we must have '"^^ < e^, 

< e^, and < e. Trying to keep M small as N and e vary leads to the forms 
for L and M chosen. If we substitute lines ^ and ^ into |8^ and simplify, we get 



, , 4 /llln^iV 16e3 TrV2 a 
91 -\ ^ + + ^-<l 

TT y CiVN ClC2 V3 C2 

The left hand side is largest when e = V2 and = 55, so it is enough to find 
constants ci and C2 such that 



4 / 11 111^55 32^/2 nV2ci , 
(92) -W ^ + ^_ + ^^<l 

TI" V ClV55 ClC2 V3 C2 

Ultimately we want L and M to be powers of two, so we find a range for each of 
ci and C2 such that the upper bound is at least twice the lower bound, and such 
that all pairs of values (ci,C2) in these ranges satisfy inequality ^ To check that 
the claimed ranges work, note that for a fixed ci, the expression increases as C2 
decreases, so it is enough to check the bound for C2 = 735. After replacing C2 in 
the expression with 735, the resulting expression has first and second derivatives 
with respect to ci over the claimed range, and the second derivative is positive, 
giving that the maximum value is assumed at an endpoint. So we only need to 
check inequality at two points: (ci,C2) = (65,735) and (2 x 65,735), both of 
which work. Thus the bound is met for all (ci,C2) in the ranges claimed. With 
these choices for M and L, note that L > 16 and M > LN C2 > eci, which is 
met over the claimed range, so all the hypo thesis for claim 1 arc satisfied. 

Finally, to prove claim 3, algorithm 3T and the proof of lemma ^ give that we 



need [log A^] qubits in the first register and max{[logL] , [log(2a + 1)]} qubits in 

M 
N 



the second register. L < ^ < 2a + 1 gives that it is enough to have [log(2a + 1)] 



qubits in the second register. Then 2q! + 1 < + 2 gives 

(93) [log(2a + 1)1 < [1 + log M - log A^] = 2 + [log M] - [log A^] 
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Thus [log M~\ + 2 is enough qubits^" 
giving [logAf] 



for the algorithm. By claim 2, we can take 



M < 2 X 735 

As noted in 
the Fm 



2 < 



12.53 + 3 log ^ 



1^] and the most time consuming step in algorithm 3.1 is 
Fourier computation. Coppersmith |^ shows how to e approximate the 
quantum Fourier transform^^ for order M = 2™ with operation complexity of 
0(log Af (loglog Af + log 1/e)). Using this to approximate our approximation within 
error e gives the time complexities in claim 3, finishing the proof. □ 

6. Conclusion 

These bounds allow simulation for many choices of N and e. However the choices 
for M and L given in theorem ^ can usually be improved, and were merely given to 
show such values can be found. For example, the following table shows, for different 
N and e combinations, a triple {g, m, I) of integers, with the choice from line |8^ 
being AI = 2^; yet in each case M = 2™ and L = 2' is the pair with minimal m 
satisfying the hypotheses for theorem ^ Thus choosing M and L carefully may 
allow lower qubit counts, such as the = 13, e = 0.10 case. 



e 


N=13 


N=25 


N=51 


N=101 


N=251 


N=501 


.001 


45,45,28 


47,47,28 


48,48,29 


50,50,29 


52,52,30 


53,53,30 


.01 


36,35,21 


37,37,22 


38,38,23 


40,40,23 


42,42,23 


43,43,24 


.05 


29,28,17 


30,30,17 


31,31,18 


33,33,18 


35,35,19 


36,36,19 


.10 


26,25,15 


27,27,15 


28,28,16 


30,30,16 


32,32,17 


33,33,17 


.20 


23,22,13 


24,24,13 


25,25,14 


27,27,14 


29,29,15 


30,30,15 


.30 


21,20,12 


22,22,12 


24,24,12 


25,25,13 


27,27,13 


29,28,14 


.40 


20,19,11 


21,21,11 


22,22,12 


24,24,12 


26,26,13 


27,27,13 



Table 1. Values 



We also simulated this algorithm for the combinations above requiring 22 or 
fewer qubits. The first test computed the algorithm error on random input vectors 
(statcs)^^. The middle set of columns in table ||, where (M, L) = (2™, 2'), shows 
the maximal error observed in the column labelled "observed e" over 100 random 
vectors. Note that the observed error is much smaller than the required bound; for 
example, with = 25, e = 0.3 the max observed error is actually 0.0182. This led 
to the second set of experiments, with results in the third set of columns in table 
^ where we tried all legal M, L combinations until we found the one with smallest 
AI value that met the desired error bound, when tested over 5000 random vectors. 
This seemed to show that the qubit cost could almost be cut in half in practice. As 
a final test, for A^ = 501 and e = 0.2, the theorem requires 30 qubits, but empirical 
testing showed 15 suffices for our 5000 test vectors, which had a maximal error of 
0.18. These results show that it is likely that significant tightening of the bounds 
presented here is possible, resulting in qubit savings. 

^'^An example requiring [log M] + 2 qubits is M = 1024, A'^ = 65, so the bound is tight. 

^-'^Many authors give a simple quantum circuit doing the quantum Fourier transform over a 
power 2"* with time complexity 0{m?); see for example jsj Chapter 5 and endnotes]. However, 
this requires m elementary operations, which seems a little like cheating. Requiring a finite fixed 
number of elementary operations would give a time complexity of O(m^). 

^^The left hand side of line b4 is the error computed. 
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N 


e 


m 


I 


observed e 


best m 


best I 


£2 


13 


0.4 


19 


11 


0.0362329 


9 


4 


0.353615 


13 


0.3 


20 


12 


0.0409662 


10 


4 


0.212023 


13 


0.2 


22 


13 


0.0187127 


11 


4 


0.158535 


25 


0.4 


21 


11 


0.0193478 


10 


4 


0.309438 


25 


0.3 


22 


12 


0.0181997 


11 


4 


0.193214 


51 


0.4 


22 


12 


0.0332493 


11 


4 


0.294778 



Table 2. Simulation results 
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